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1 A new look at Kleene’s theorem 

Not very many results in computer science are recognised as being as basic and funda¬ 
mental as Kleene’s theorem. It was originally stated as the equality of two sets of objects, 
and is still so, even if the names of the objects have changed — see for instance Theo¬ 
rem 1.4.11 in Chapter 1 of [47]. This chapter proposes a new look at this statement, in 
two ways. First, we explain how Kleene’s theorem can be seen as the conjunction of two 
results with distinct hypotheses and scopes. Second, we express the hrst of these two 
results as the description of algorithms that relate the symbolic descriptions of the objects 
rather than as the equality of two sets. 


A two step Kleene’s theorem In Kleene’s theorem, we hrst distinguish a step that con¬ 
sists in proving that the set of regular (or rational) languages is equal to the set of lan¬ 
guages accepted by finite automata — a set which we denote by Rat A*. This seems 
already to be Kleene’s theorem itself and is indeed what S. C. Kleene established in [34]. 
But it is not, if one considers — as we shall do here — that this equality merely states the 
equality of the expressive power of rational expressions and that of hnite labelled directed 
graphs. This is universally true. It holds independently of the structure in which the labels 
of the automata or the atoms of the expressions are taken, in any monoids or even in the 
algebra of polynomials under certain hypotheses. 

By the virtue of the numerous properties of hnite automata over hnitely generated 
(f.g., for short) free monoids: being apt to determinisation for instance, the family of lan¬ 
guages accepted by such automata is endowed with many properties as well: being closed 
under complementation for instance. These properties are extraneous to the dehnition of 
the languages by expressions, and then — by the former result — to the dehnition by 
automata. It is then justihed, especially in view of the generalisation of expressions and 
automata to other monoids and even to other structures, to set up a dehnition of a new 
family of languages by new means, that will extend in the case of other structures, these 
properties of the languages over f.g. free monoids. It turns out that the adequate dehnition 
will be given in terms of representations by matrices of finite dimension; we shall call the 
languages dehned in that way the recognisable languages and we shall denote their family 
by Rec A*. The second step of Kleene’s theorem consists then in establishing that hnite 
automata are equivalent to matrix representations of hnite dimension under the hypothesis 
that the labels of automata are taken in f.g. free monoids. 

These two steps correspond to two different concepts: rationality for the hrst one, and 
recognisability for the second one. This chapter focusses on rationality and on the hrst 
step, namely the equivalence of expressiveness of hnite automata and rational expressions. 
For sake of completeness however, we sketch in Section 2 how one gets from rational sets 
to recognisable sets in the case of free monoids and in Section 5, we see that the same 
construction fails in non-free monoids and explore what remains true. 


The languages and their representation Formal languages or, in the weighted variant, 
formal power series, are potentially infinite objects. We are only able to compute finite 
ones; here, expressions that denote, or automata that accept, languages or series. Hope¬ 
fully, these expressions and automata are faithful description of the languages or series 
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they stand for, all the more effective that one can take advantage of this double view. 

In order to prove that the family of languages accepted by finite automata coincide 
with that of the languages denoted by rational expressions we proceed by establishing 
a double inclusion. As sketched in Figure l', given an automaton A that accepts a lan¬ 
guage K, we describe algorithms which compute from A an expression F that denotes the 
same language K — I call such algorithms a F -map. Conversely, given an expression E 
that denotes a language L, we describe algorithms that compute from E an automaton B 
that accepts the same language L — I call such algorithms a A-map. 

Most of the works devoted to the conversion between automata and expressions ad¬ 
dress the problem of the complexity of the computation of these F- and A-maps. I have 
chosen to study here the maps for themselves, how the results of different maps applied to 
a given argument are related, rather than to describe the way they are actually computed. 
The F-maps are considered in Section 3, the A-maps in Section 4. 



The path to generalisation The main benefit of splitting Kleene’s theorem into two 
steps is to bring to light that the first one is a statement whose scope extends much be¬ 
yond languages. It is first generalised to subsets of arbitrary monoids and then, with some 
precaution, to subsets with multiplicity, that is, to (formal power) series. This latter exten¬ 
sion of the realm of Kleene’s theorem is a matter for the same ‘splitting’ and distinction 
between series on arbitrary monoids and series on f.g. free monoids. 

It would thus be possible to first set up the convenient and most general structure and 
then state and prove Kleene’s theorem in that framework. My experience, however, is that 
many readers tend to be repelled and flee when confronted with statements outside the 
classical realm of words, languages, and free monoids. This is where I stay in the first 
three sections of this chapter. The only difference with the classical exposition will be in 
the terminology and notation that will be carefully chosen or coined so that they will be 
ready for the generalisation to arbitrary monoids in Section 5 and to series in Section 6. 

Notation and definitions given in Chapter 1 are used in this chapter without comment 
when they are refered to under the same form and with the exact same meaning. 


(A*) denotes the power set of A*, that is, the set of all languages over A*. 
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2 Rationality and recognisability 

We first introduce here a precise notion of rational expression, and revisit the definition of 
finite automata in order to fix our notation and to state, under the form that is studied here 
and eventually generalised later, what we have called above the ‘first step of Kleene’s 
theorem’ and which we now refer to as the Fundamental theorem of finite automata. 
Second, we state and prove ‘the second step’ of Kleene’s theorem in order to make the 
scope and essence of the first step clearer by contrast and difference. 


2.1 Rational expressions 

The set of rational languages of A*, denoted by Rat A* is the smallest subset of (A*) 
which contains the finite sets (including the empty set) and is closed under union, product, 
and star. A precise structure-revealing specification for building elements of this family 
can be given by rational expressions. 

Definition 2.1. A rational expression over A* is a well-formed formula built inductively 
from the constants 0 and 1 and the letters a in A as atomic formulas, using two binary 
operators + and • and one unary operator : if E and F are rational expressions, so are 
(E + F), (E • F), and (E*). We denote by RatE A* the set of rational expressions over A* 
and often write expression for rational expression. (As in [54], ‘rational expression’ is 
preferred to the more traditional regular expression for several reasons and in particular 
as it will be used in the weighted case as well, see Section 6.) 

With every expression E in RatE A* is associated a language of A*, which is called 
the language denoted by E and we write^ it as |E|. The language |E| is inductively defined 
by^ |0| = 0, |1|= {1^.} , |o|= {a} for every a in A, |(E + F)| = |E|u|F|, |(E • F)| = |E||F|, 
and |(E*)| = {|E|}* . Two expressions are equivalent if they denote the same language. 

Proposition 2.1. A language is rational if and only if it is denoted by an expression. 

Like any formula, an expression E is canonically represented by a tree, which is called 
the syntactic tree of E. Let us denote by (. (E) the literal length of the expression E (that 
is, the number of all occurences of letters from A in E) and by d (E) the depth of E which 
is defined as the depth — or height‘s— of the syntactic tree of the expression. 

The classical precedence relation between operators: ‘ > • > -F ’ allows to save 

parentheses in the writing of expressions: for instance, E + F • G* is an unambiguous 
writing for the expression (E 4- (F • (G*))). But one should be aware that, for instance, 
(E • (F • G)) and ((E • F) • G) are two equivalent but distinct expressions. In particular, the 
derivation that we define at Section 4 yields different results on these two expressions. 


^The notation Z,(E) is more common, but |E| is simpler and more appropriate when dealing with expressions 
over an arbitrary monoid or with weighted expressions. 

^The empty word of A* is denoted by 1^* ■ 

“^We rather not use height because of the possible confusion with the star height, cf. Section 4. 
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In the sequel, any operator defined on expressions is implicitely extended additively 
to sets of expressions. For instance, it holds: 

VXCRatEA* m=UEeJf|E|. 

Definition 2.2. The constant term of an expression E over A* , written c( E), is the Boolean 
value, inductively defined and computed using the following equations: 


c(0) = 0 , c(l) = 1, Vo € A c(a) = 0 , 
c(F + G) = c(F) + c(G), c(F • G) = c(F)c(G), c(F*) = 1 . 

The constant term of a language L of A* is the Boolean value c{L) that is equal to 1 
if and only if Ia-^ belongs to L. By induction on d(E), c(E) = c(|E|) holds. 


2.2 Finite automata 


We denote an automaton over A* hy A = {Q, A, E,I,T) where Q is the set of states, 

and is also called the dimension of A, I and T are subsets of Q, and E C Qx AxQ 

is the set of transitions labelled by letters of A. The automaton A is finite if E is finite, 

hence, if A is finite, if and only if (the useful part of) Q is finite. 

A computation in A from state p to state q with label w is denoted by p q . The 

A 

language accepted^ by A, also called the behaviour of A, denoted by |^|, is the set of 
words accepted by A, that is, the set of labels of successful computations: 


\A\=[wgA* 


3i € 1,3t €T 



The first step of Kleene’s theorem, which we call Fundamental theorem of finite automata 
then reads as follows. 

Theorem 2.2. A language of A* is rational if and only if it is the behaviour of a finite 
automaton over A*. 

Theorem 2.2 is proved by building connections between automata and expressions. 

Proposition 2.3 (F-maps). For every finite automaton A over A*, there exist rational 
expressions over A* which denote\A\ 

Proposition 2.4 (A-maps). For every rational expression E over A*, there exist finite 
automata over A* whose behaviour is equal to |E|. 

Section 3 describes how expressions are computed from automata. Section 4 how 
automata are associated with expressions. Before going to this matter, which is the main 
subject of this chapter, let us establish the second step of Kleene’s theorem. 

prefer not to speak of the language ‘recognised’ by an automaton, and I would not say that a language is 
‘recognisable’ when accepted by a finite automaton, in order to have a consistent terminology when generalising 
automata to arbitrary monoids. 
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2.3 The ‘second step’ of Kleene’s theorem 

Let us first state the definition of recognisable languages, under the form that is given for 
recognisable subsets of arbitrary monoids {cf. Section 1.5.2). 

Definition 2.3. A language L of A* is recognised by a morphism a from A* into a 
monoid N if L = a~^{a{L)) . A language is recognisable if it is recognised by a mor¬ 
phism into a finite monoid. The set of recognisable languages of A* is denoted by Rec A*. 

Theorem 2.5 (Kleene). If A is a finite alphabet, then Rat A* = Rec A*. 

The proof of this statement paves the way to further developments in this chapter. Let 
A = {Q,A,E,I,T) be a finite automaton. The set E of transitions may be written as 
a Qx Q-matrix, called the transition matrix of A, also denoted by E, and whose {p, q)- 
entry is the set (the Boolean sum) of letters that label the transitions from pio q in A. A 
fundamental (and well-known) lemma relates matrix multiplication and graph walking. 

Lemma 2.6. Let E be the transition matrix of the automaton A of finite dimension Q. 
Then, for every n in N, E'^ is the matrix of the labels of paths of length n in A: 

= {w&A^\pA^qY 

The subsets / and T of Q may then be seen as Boolean vectors of dimension Q (I as 
a row and T as a column-vector). From the notation E* = X^neN follows: 

\A\=I-E*-T. (2.1) 

The next step in the preparation of the proof of Theorem 2.5 is to write the transition 
matrix E as a formal sum E = Xoe.4 /^(®) ® ’ where for every a in A, ii{a) is a Boolean 
Q X Q-maixix. These matrices p{a) define a morphism p: A* ^ . The second 

lemma involves the freeness of A* and reads: 

Lemma 2.7. Let p: A* ^ be a morphism and let E = XasA M(o)a. Then, for 

every n in N, = XujgA" m(^) tv and thus E* = XtueA* Ti^v) w . 

Proof of Theorem 2.5. By Theorem 2.2, a rational language L of A* is the behaviour of a 
finite automaton A= {Q, A, E,I,T) .By (2.1) and Lemma 2.6, we write 

L = \A\ = {w € A* \ I ■ p{w) • T = 1} . 

and thus L = p~^{S) where S = {m € | I ■ m ■ T = l} and L is recognisable. 

Conversely, let L be a recognisable language of A*, recognised by the morphism 
a: A* N and let S = a{L). Consider the automaton Aa = {N, A, E, {1^}, S) 
where E = { (n, a,na{a)) | a G A, n £ A^} . It is immediate that 

\Aa\ = G A* 3p G S In —^ = {tc G A* | a{w) G S} = a~^{S) = L 

and L is rational by Theorem 2.2. □ 
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We postpone to Section 5 the example that shows that recognisability and rationality 
are indeed two distinct concepts and the description of the relationships that can be found 
between them. As mentioned in Chapter 1, the following holds. 

Theorem 2.8. The equivalence of finite automata over A* is decidable. 

Proposition 2.4 then implies: 

Corollary 2.9. The equivalence of rational expressions over A* is decidable. 


3 From automata to expressions: the F-maps 

For the rest of this section, A — {Q,A,E,I,T) is a finite automaton over A*, and E 
is viewed, depending on the context, as the set of transitions or as the transition matrix 
of A. As in (2.1), the language accepted by A is conveniently written as 

In order to prove that |.4| is rational, it is sufficient to establish the following. 

Proposition 3.1. The entries of E* belong to the rational closure of the entries of E. 

But we want to be more precise and describe procedures that produce for every entry 
of E* a rational expression whose atoms are the entries of E (and possibly 1). There are 
(at least) four classical methods to proving Proposition 3.1, which can easily be viewed 
as algorithms serving our purpose and which we present here: 

(1) Direct computation of |.4|: the state-elimination method looks the most elementary 
and is indeed the easiest for both hand computation and computer implementation. 

(2) Computation of i7* • T as a solution of a system of linear equations. Based on 
Arden’s lemma, it also allows to consider E* -T as a fixed point. 

(3) Iterative computation of E*: known as McNaughton-Yamada algorithm and prob¬ 
ably the most popular among textbooks on automata theory. 

(4) Recursive computation of E*: based on Arden’s lemma as well, this algorithm 
combines mathematical elegance and computational inefficiency. 

The first three are based on an ordering of the states of the automaton. For comparing 
the results of these different algorithms, and of a given one when the ordering of states 
varies, we first introduce the notion of rational identities, together with the key Arden’s 
lemma for establishing the correctness of the algorithms as well as the identities. The sec¬ 
tion ends with a refinement of Theorem 2.2 which, by means of the notions of star height 
and loop complexity, relates even more closely an automaton and the rational expressions 
that are computed from it. 


3.1 Preparation: rational identities and Arden’s lemma 

By definition, all expressions which denote the behaviour of a given automaton A are 
equivalent. We may then ask whether, and how, this equivalence may be established 
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within the world of expressions itself. We consider ‘elementary equivalences’ of more or 
less simple expressions, which we call rational identities, or identities for short, and which 
correspond to properties of (the semiring of) the languages denoted by the expressions. 
And we try to determine which of these identities, considered as axioms, are necessary, 
or sufficient, to obtain by substitution one expression from another equivalent one. It is 
known — and out of the scope of this chapter — that no finite sets of identities exist that 
allow to establish the equivalence of expressions in general (see Chapter 20). We shall 
see however that a basic set of identities is sufficient to deduce the equivalence between 
the expressions computed by the different F-maps described here. 

Trivial and natural identities A first set of identities, that we call trivial identities, 
expresses the fact that 0 and 1 are interpreted as the zero and unit of a semiring; 

E+0 = E, 0+E = E, E-0 = 0, 0-E = 0, E-1 = E, 1-E = E, 0* = 1 (T) 

An expression is said to be reduced if it contains no subexpressions which is a left-hand 
side of one of the above identities; in particular, 0 does not appear in a non-zero reduced 
expression. Any expression H can be rewritten in an equivalent reduced expression H'; 
this H' is unique and independent of the way the rewriting is conducted. From now on, all 
expressions are implicitely reduced, which means that all the computations on expressions 
that will be defined below are performed modulo the trivial identities. 

The next set of identities expresses the fact that the operators -|- and • are interpreted 
as the addition and product in a semiring with their associativity, distributivity and com¬ 


mutativity properties: 

(E+F)-eg = E+(F +G) and (E • F) • G = E • (F • G) , (A) 

E • (F -E G) = E • F + E • G and (E + F) • G = E • G + F • G , (D) 

E+F = F + E . (C) 

The conjunction A A D A C is abbreviated as N and called the set of natural identities. 

Aperiodic identities The product in Cp (A*) is distributive over infinite sums; then 

K* = 1a-+ K*K = 1a-+ KK* , (3.1) 

from which we deduce the identities: 

E* = 1 -E E • E* and E* = 1 -E E* • E . (U) 


From (U) and the gradation® of A* follows Arden’s lemma whose usage is ubiquitous. 

Lemma 3.2 (Arden). Let K and L be two subsets of A*. Then K*L is a solution, 
K*L is the unique solution if c{K) = 0, of the equation X = AX -E L. 

For computing expressions, we prefer to use Arden’s lemma under the following form: 

Corollary 3.3. Let K and L be two rational expressions over A* with c(K) = 0. Then, 
K*L denotes the unique solution of IL = |K|X -E|L|. 

^That is, the elements of A* have a length which is a morphism from A* onto N {cf. Section 6). 
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The next two identities, called aperiodic identities, are a consequence of Lemma 3.2. 

Proposition 3.4. For all rational expressions E and F over A* 

(E + F)* = E*.(F-E*)* and (E + F)* = (E* • F)* • E* , (S) 

(E • F)* = 1 + E - (F • E)* • F . (P) 

There are many other (independent) identities {cf. Notes). The remarquable fact is 
that those listed above will be sufficient for our purpose. 

Identities special to (A*) Finally, the idempotency of the union in (A*) yields two 
further identities: 

E + E = E , (I) (E*)* = E* . (J) 

In contrast with the preceding ones, these two identities (I) and (J) do not hold for 
expressions over arbitrary semirings of formal power series {cf. Section 6). 


3.2 The state-elimination method 

The algorithm known as state-elimination method, originally due to Brzozowski and Mc- 
Cluskey [13], works directly on the automaton A = {Q,A,E,I,T). It consists in sup¬ 
pressing the states in A, one after the other, while transforming the labels of the transitions 
so that the language accepted by the resulting automaton is unchanged (cf. [61, 62]). 

A current step of the algorithm is represented at Figure 2. The left diagram shows the 
state q to be suppressed, a state pi which is the origin of a transition whose end is q and a 
state Tj which is the end of a transition whose origin is q (it may be the case that pi = rf). 
By induction, the labels are rational expressions. The right diagram shows the automaton 
after the suppression of q, and the new label of the transition frompi to rj. The languages 
accepted by the automaton before and after the suppression of q are equal — a formal 
proof will follow in the next subsection. 



Figure 2. One step in the state-elimination method 

More precisely, the state-elimination method consists first in augmenting the set Q 
with two new states i and t, and adding transitions labelled with 1 from i to every initial 
state of A and from every final state of A to t. Then all states in Q are suppressed 
according to the procedure described above and in a certain order w(that can be decided 
beforehand or determined step by step). At the end, only remain states i and t, together 
with a transition from itot labelled with an expression which we denote by {A) and 
which is the result of the algorithm. Thus it holds: 

\A\ = \B^{A)\ . 
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Figure 3 shows every step of the state-elimination method on the automaton P 3 drawn 
in the upper left corner and following the order uji = r < p < q. It shows the result 
^bJi{T^z) = a*b{ba*b + ab*a)*ba* + a* . The computation of B^(^) may silently 
involve identities in N. A common and natural way of performing the computation is to 
use identities I and J as well; it yields simpler results. It is then to be stressed that the use 
of I and J is not needed to establish these equivalence results. 

The effect of the order The result of the state-elimination method obviously depends 
on the order w in which the states are suppressed. For instance, on the automaton X >3 of 
Figure 3, the other order uj 2 = r < q < p yields B^ 2 (T> 3 ) = (a -f b{ab*a)*b)*, and 
UI 3 = p < q < r yields 

B ^3 {V 3 ) = a* + a*b{ba*b)*ba* + a*b{ba*b)*a(b + a {ba*b)*a)*a (ba*b)*ba*. 



Theorem 3.5 (Conway [18], Krob [35]). Let oj and oj' be two orders on the set of states 
of an automaton A. Then, NASAP I— Bcj(aI) = Bcj'(aI) holds. 


Proof We can go from any order w to any other order w' on Q by a sequence of transpo¬ 
sitions. We therefore arrive at the situation illustrated in Figure 4 (left) and need to show 
that the expressions obtained when we first remove the state r and then r' are equivalent 
to those obtained from removing first r' and then r, modulo SAP (without mentioning 
the natural identities). 



Figure 4. First step of two in the state-elimination method 


The removal of state r gives the expressions in Figure 4 (right). The removal of state r' 
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gives the expression; 

E = KL*H + (KL*G + K') [G'L*G + L']* (G'L*H + H') , 
which using S (and the natural identities) becomes: 


E = KL*H + KL*G [L'*G'L*G]* L'*G'L*H 

+ K' [L'*G'L*G]* L'*G'L*H + KL*G [L'*G'L*G] * L'*H' 

+ K' [L'*G'L*G]*L'*H'. 

We write: 


K' [L'*G'L*G]*L'*H' 


K'L'*H' + K'L'*G'L* [GL'*G'L*]* 


GL'*H' 


by using P, and then, by‘switching the brackets’(using the identity (XY)*X = X(YX)* 
which is also a consequence of P), we obtain: 


E = KL*H 

+ KL*G [L'*G'L*G]* L'*G'L*H + K'L'*G' [L*GL'*G']* L*H 
+ KL*G [L'*G'L*G]*L'*H' + K'L'*G' [L*GL'*G']* L*GL'*H' 




an expression that is perfectly symmetric in the letters with and without ‘primes’, which 
shows that we would have obtained the same result if we had started by removing r' 
then r. □ 


Aside from the formal proximity between expressions obtained from a given automa¬ 
ton, the question of the length of these expressions is of course of interest, both from a 
theoretical as well as practical point of view. The above example V is easily generalised 
so as to find an exponential gap between the length of expressions for two distinct orders. 
The search for short expressions is performed by heuristics, with more or less degree of 
sophistication (see Notes). 


3.3 The system-solution method 

The computation of an expression that denotes the language accepted by a finite automa¬ 
ton as the solution of a system of linear equations is nothing else than the state-elimination 
method turned into a more mathematical setting, which allows then easier formal proofs. 


Description of the algorithm Given A = {Q,A,E,I,T), for every p in Q, we 
write Lp for the set of words which are the label of computations from p to a final state 


of A: 


Lp — 


w 


G A* 


G T 



For a subset R of Q, we write the symbol 
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(5pfor 1 if p is in i? and 0 if not. The system of equations associated with A is written: 


i“4|— E/ ~ E/ ^pd ^p 

(3.2) 

pel peQ 


yp € Q Lp = E! |Fp,g|^9 +I'^p,t| 

(3.3) 

qeQ 



where the Lp are the ‘unknowns’ and the entries which represent subsets of A, as 
expressions Ep^q are sums of letters labelling paths of length 1. The system (3.3) may be 
solved by successive elimination of the unknowns. The pivoting operations, which involve 
subtraction and division that are not available in the semiring ^ (^*), are replaced by the 
application of Arden’s lemma, since c(Ep q) = 0 for all p, q in Q. 

After the elimination of a certain number of unknowns Lp — we write Q' for the set 
of indices of those which have not been eliminated — we obtain a system of the form: 

W = E|Gpl^t>+|H| (3-4) 

peQ' 

Vp e Q' ip = E + |Kp| (3.5) 

q&Q' 

If we choose (arbitrarily) one element q in Q', Corollary 3.3 applied to the corresponding 
equation from the system (3.5), yields: 

~\^q,q\ I E/ l^9.p|ip+ 1 ^ 9 ! I (3-6) 

\peQ'\q / 

which allows the elimination of Lq in (3.4)-(3.5) and gives: 

W= E + + (3.7) 

rGQ'\q 

\/r€Q'\p Lr = E (|Fr,p + F* ,jFr,p|) Tp+|Kr + F* . (3.8) 

P6Q'\9 

When all unknowns Lq have been eliminated in the ordering w on Q, the computation 
yields an expression that we denote by E^(^) and (3.7) becomes: 

W = |E.(^)I ■ 

As for the state-elimination method, the identities N (and I and J) are likely to have been 
involved at any step of the computation of E^(^). 

Comparison with the state-elimination method The state-elimination method and the 
system-solution are indeed one and the same algorithm for computing the language ac¬ 
cepted by a finite automaton, as stated by the following. 

Proposition 3.6 ([54]). For any order w on the states of A it holds: 

bua) = -eua) . 
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Proof. We can build a generalised automaton B' corresponding to the system (3.4)-(3.5), 
with set of states is Q' U {i, f], where i and t do not belong to Q', and such that, for all p 
and q in Q'\ 

(i) the transition from p to g is labelled 

(ii) the transition from p to f is labelled Kp, 

(iii) the transition from i to p is labelled Gp; and 

(iv) the transition from i to f is labelled H. 

Note that this definition applied to the system (3.2)-(3.3) characterises the automaton 
constructed in the first phase of the state-elimination method applied to A. 

The elimination in the system (3.4)-(3.5) of the unknown Lq by substitutions and 
the application of Arden’s lemma give the system (3.7)-(3.8) whose coefficients are ex¬ 
actly the transition labels of the generalised automaton obtained by removing the state q 
from B'. 

Thus, since the starting points correspond and since each step maintains the corre¬ 
spondence, the expression obtained for |^| by the state-elimination method is the same as 
that obtained by the solution of the system (3.2)-(3.3). □ 

The state-elimination method reproduces, in the automaton A, the computations cor¬ 
responding to the solution of the system: the latter is s. formal proof of the former. As 
another consequence of Proposition 3.6, the following corollary of Theorem 3.5 holds: 

Corollary 3.7. Let uj and to' be two orders on the set of states of an automaton A Then, 
NASAP I— E^(.4) =Eu:'{A) . 


3.4 The McNaughton-Yamada algorithm 

Given A = {Q, A, E, I,T), the McNaughton-Yamada algorithm ([43]) — called here 
MN-Y algorithm for short — truly addresses the problem of computing the matrix E*, 
whereas the two preceding methods rather compute the sum of some of the entries of E*. 
Like the former methods, it relies on an ordering of Q but it is based on a different group¬ 
ing of computations^ within A. 


Description of the algorithm We write Mp^q for (E*)p q: 


Mp^q =\w& A* 



The set Q ordered by w is identified with the set of integers from 1 to n = Card (Q). 
The key idea of the algorithm is to group the set of paths between any states p and g in Q 
according to the highest rank of the intermediate states. We denote by Mp^q the set of 
labels of paths from p to g which do not pass through intermediate states of rank greater 
than k. And we shall compute expressions such that |Mp^g| = Mp^q . 


^In order to avoid confusion between the computations of expressions that denote the language accepted 
by A and whose variations are the subject of the chapter, and the computations within A, which is the way we 
call the paths in the labelled directed graph A, we use the latter terminology in this section. 
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A path that does not pass through any intermediate state of rank greater than 0 passes 
through no intermediate states, and therefore reduces to a single transition. Thus, Mp^q is, 
for all p and q in Q, the set of labels of transitions which go from p to q; that is, Mp^q = 
Ep^q and Mp°g = Ep ^ . A path which goes from p to q without visiting intermediate 
states of rank greater than k is: 

(a) either a path (from p to q) which does not visit intermediate states of rank greater 
than fc — 1 ; 

(b) or the concatenation: 

• of a path from pto k without passing through states of rank greater than k—V, 

• followed by an arbitrary number of paths which go from k to k without passing 
through intermediate states of rank greater than A: — 1; 

• followed finally by a path from k to q without passing through intermediate states 
of rank greater than k — 1. 

This decomposition is sketched in Figure 5 and implies that for all p and q in Q, for 
all k ^ n, it holds: 


p,q p,q ' 


'p,k 




'k,q 


The algorithm ends with the last equation: 

if P^q, Mp., = + 1 if p = q . 

For consistency with the previous sections, we write Mi^(,4,) = X^pe/ q^T *'■ 

holds: 

N = |MX^)|. 



Figure 5. Step k of MN-Y algorithm 


Example 3.1. The MN-Y algorithm applied to the automaton TZi of Figure 6 yields the 
following matrices (we group together, for each k, the four Mp^] into a matrix 


= 


= 


a{a)*a 
■ a{a)*a 


a{a)*b 

a{a)*b 


fa + a{a)*a + (5 + a{a)*b){b + a{a)*b)*{a + a{a)*a) 

\a + a{a)*a + {b + a(a)*5)(5 + a(a)* 6 )*(a + a(a)*a) 

(6 + a{a)*h) + (6 + a{a)*b){b + a{a)*b)*{b + a{a)*b)\ 
{b + a{a)*b) + {b + a{a)*b){b + a{a)*b)*lb + a{a)*b) J 
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As in the first two methods, identities in N (as well as I and J) are likely to be used at 
any step of the MN-Y algorithm. What is new is that identities D and U are particularly 
fit for the computations involved in the MN-Y algorithm. For instance, after using these 
identities, the above matrices become; 




fa*a a*b\ ( 2 ) _ /(a*6)*a*a {a*b)*a*b\ 

\a*a a*bj ~ \{a*b)*a*a {a*b)*a*bj 



Figure 6. The automaton TZi 


Comparison with the state-elimination method Comparing the MN-Y algorithm with 
the state-elimination method amounts to relating two objects whose form and mode of 
construction are rather different: on the one hand, a Q x Q-matrix obtained by successive 
transformations and on the other hand, an expression obtained by repeated modifications 
of an automaton, hence of a matrix, but one whose size decreases at each step. This leads 
us to a more detailed statement. 

Proposition 3.8 ([54]). Let A= {Q, A, E, I ,T) be an automaton and for every p and q 
inQ, let Ap^q be the automaton defined by Ap^q = {Q^A,E,{p\,{q}). For every (total) 
order to on Q and every p and q in Q, it holds: 

NAU I- M^{Ap,q) =BUAp,q) . 

Proof In the following, A and uj are fixed and remain implicit. The automaton A has n 
states, identified with the integers from 1 to n; the two algorithms perform n steps starting 
in a situation called ‘step O’, the fcth step of the state-elimination method consisting of the 
removal of state k, and that of algorithmMN-Y consisting of calculating the labels of 
paths that do not include nodes (strictly) greater than k. We write: 

EW(r,s) 

for the label of the transition from r to s in the automaton obtained from A (and w) at 
the kth step of the state elimination method; necessarily, in this notation, k + 1 ^ r and 
k + 1 ^ s (abbreviated to k + 1 ^ r, s). As above, we write: 



for the entry r, s of the nxn matrix computed by the fcth step of MN-Y algorithm. At 
step 0, the automaton A has not been modified and we have; 

Vr, s , 1 ^ r, s ^ n (r, s) , (3.9) 

which will be the base case of the inductions to come. The MN-Y algorithm is written: 

V/c, 0 < k ^ n , Vr, s , 1 ^ r, s ^ n 

• (Mi';'))* . . (3.10) 
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The state-elimination algorithm is written; 

Vfc , 0 < k ^ n, Vr, s , k < r, s ^ n 

E('=)(r,s) = E('=-i)(r,s) + E('=-i)(r,fc) • (fc, fc))* • E('=-i)(fc,s) (3.11) 

Hence we conclude, for given r and s and by induction on k: 

Vr, s , 1 < r, s < n , V/c, 0 ^ fc < min(r, s) = E*-^^ (r, s) (3.12) 

We see in fact (as there is even so something to see) that if fc < min(r, s) then all integer 
triples {I, u, v) such that M^t, occurs in the computation of by the (recursive) use 
of (3.10), are such that ( < min(M, v). 

Suppose now that we have p and q, also fixed, such that 1 ^ p < q ^ n (the other 
cases are dealt with similarly). We call the initial and final states added to A in the 
first phase of the state-elimination method i and t respectively; i and t are not integers 

between 1 and n. The transition from z to p and that from g to f are labelled I a* ■ Now let 

(v) 

us consider step p of each algorithm. For every state s,p < s, is given by (3.10): 

M(p) = -f • M(p“1 ) 

' 'p,s ' 'p,S ' P,P I I P,S 

and E(p)(z, s) by: 

E(P)(z,s) = (e(p-i)(p,p))*.E(p-i)(p,s) 

and hence, by (3.12): 

Vs,p<s^n U I— M^Pl = E^P\i,s) . (3.13) 

Next we consider the steps following p (and row p of the matrices M (*)). For all k,p <k, 
and all s. A: < s ^ n, IVIp^i is always computed by (3.10) and E^^^ (z, s) by: 

E('=)(i,s) = 4- E^'^-^\i,k) ■ (E('=-i)(fc, fc)) * • E('=-i)(A:, s) . (3.14) 

From (3.13), and based on an observation analogous to the previous one, we conclude 
from the term-by-term correspondence of (3.10) and (3.14) that: 

yk,p<k,Vs,p<si^n Ul— = E('=)(i,s) . (3.15) 

The analysis of step q gives a similar, and symmetric, result to that which we have just 
obtained from the analysis of step p: for all r, q < r, we have; 

M(9) = 

' 'r,q ' 'r,q ' ' 'r,q \ Q,Q I 

and E(«)(r,f) = E^‘i-^\r,q) • g)) * 

and hence 

Vr,g<r<n U I— M<,«^ = E(«)(r,f) . (3.16) 

The steps following g give rise to an equation symmetric to (3.15) (for column g of the 
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matrices M 

yk,q<k,yr,q<ri^n Ul— M^ (r,t) . (3.17) 

Finally, from: 

■ (mIa'0* • 

and EW(z,f) = E('=-i)(i,f) + E^'^-^\i,k) ■ (fc, fc))* • E^'^-^\k,t) 

Equations (3.12), (3.15) and (3.17) together allow us to conclude, by induction on k, that: 

yk,qi^k^n Ul— . (3.18) 

When we reach fc = n in this equation we obtain the identity we want. □ 

As a consequence of Proposition 3.8, we have the following corollary of Theorem 3.5: 

Corollary 3.9. Let w and ui' be two orders on the states of an automaton A. Then, 
NASAP I— M„(a1) = M,,-(.4) . 


3.5 The recursive method 

The last method we want to quote appeared hrst in Conway’s book Regular Algebra and 
Finite Machines [18] which gave a new start to the formal study of rational expressions 
{cf. Chapter 20). Originally, it yields a proof of Proposition 3.1(the entries of E* belong 
to the rational closure of the entries of E). As we did above, we modify it so as to make it 
compute from E, a matrix of rational expressions which denotes E, a matrix E' of rational 
expressions which denotes the matrix E*. 


Description of the algorithm The recursive method (for computing E', our goal) is 
based on computation on matrices via bloc decomposition. 

Let M and M' be two Q x Q-matrices (over any semiring indeed) and let Q be the 
disjoint union of R and S. Let us write their bloc decomposition according to Q = RU S 
as: 


M = 






where F and F' are R x i?-matrices, K and K' are S x 5'-matrices, G and G' Rx S- 
matrices, and E[ and H' S x i?-matrices. The bloc decomposition is consistent with the 
matirx operations in the sense that we have: 


M + M' 


fF + F' G + G'\ 
\H + H' K + K'J 


M ■ M' = 


fF-F' + G-H' 
\H ■ F' + K ■ H' 


F-G' + G-K'\ 
H ■ G' + K ■ K') 
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Let us write a block decomposition of E and the corresponding ones for E and E*: 


E = 






E* = 


U V 

w z 


where E and K (and thus F, K, E and Z) are square matrices. By (3.1)*, it follows that 


E* 








an equation which can be decomposed into a system of four other equations: 


[/= 1+|F|C/+|G|VL , Z = 1 +|H|y+|K|Z , 

V=\f\V+\G\Z, and IL = |H|{7 + |K| VL . 


Corollary 3.3 applies to (3.20) and then, after substitution, to (3.19) gives: 
V=\f\*\G\Z and W =\K\*\H\U . 

U = (|F| + |G||Kr|Hr and Z = (|K| + |H||Fr|Gr • 


(3.19) 

(3.20) 


This procedure leads to the computation of E* by induction on its dimension. By the 
induction hypothesis, obviously fulfilled for matrices of dimension 1, |F|* and |K|* are 
denoted by matrices of rational expressions F' and K'. Let us write 


f (F + GK'H)* F'G(K +HF'G)*\ 
\^K'H(F + GK'H)* (K + HF'G)* ) 


and |E'| = E* holds. Another application of the induction hypothesis to |(F + G K' H)|* 
and |(K + FI F' G)|* shows that the entries of E', which we denote by C7-(7l) , where r is 
the recursive division of Q used in the computation, are all in RatE A*. 


Example 3.2. The recursive method applied to the automaton TZi of Example 3.1 (cf. Fig¬ 
ure 6) directly gives (there is no choice for the recursive division): 


r {a + b{b)*a)* a*b{b + a{a)*b)*\ 

\b*a{a + b{b)*a)* {b + a{a)*b)* J 


Comparison with the state-elimination method Both the recursive method and the 
MN-Y algorithm yield a matrix of expressions. Example 3.2 shows that there is no hope 
for an easy inference of teh equivalence of the two matrices. We state however the fol¬ 
lowing conjecture. 

Conjecture 3.10. Let A = {Q, A, E, I,T) be an automaton. For every recursive divi¬ 
sion T of Q and for every pair (p, q) of states, there exists an ordering u) of Q such that: 

NAU ^ . 

More generally, and as a conclusion of the description of these four methods, one 
would conjecture that the rational expressions computed from a same finite automaton 

^applied to matrices with entries in tp (A*) rather than to elements of (A*). 
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are all equivalent modulo the natural identities and the aperiodic ones S and P. Even 
if computed from is not formal enough, the above developments should make the general 
idea rather clear. 


3.6 Star height and loop complexity 

The purpose of this last subsection is to present a refinement of Kleene’s Theorem — 
or, rather, of the Fundamental Theorem of Finite Automata — which relates even more 
closely than above an automaton and the rational expressions that are computed from it. 

Among the three rational operators +, • and *, the operator * is the one that ‘gives 
access to the infinite’, hence the idea of measuring the complexity of an expression by 
finding the degree of nestedness of this operator, a number called star height. On the 
other hand, it is the circuits in a finite automaton that produce an infinite number of 
computations, ‘all the more’ that the circuits are more ‘entangled’. The intuitive idea of 
entanglement of circuits will be captured by the notion of loop complexity. We show how 
the loop complexity of an automaton to the star height of an expression that is computed 
from this automaton, a result which is due originally to Eggan ([21]). 

Star height of an expression Fet E be an expression over A*. The star height of E, 
denoted by h[E], is inductively defined by 

if E = 0,E = lorE = aGA, h[E] = 0 , 

if E = E' +E" or E = E'• E", h[E] = max(h[E'], h[E"]) , 

if E = F*, h[E] = l + h[F] . 


Example 3.3. (i) h[(a + 5)*] = l; h[a* (6a*)*] = 2 . 

(ii) The heights of the three expressions computed for the automaton at Section 3.2 
are: h[B^, ( 223 )] = 2 , h[B^, ( 223 )] = 3, and h[B„ 3 ( 223 )] = 3 . 

These examples draw attention to the fact that two equivalent expressions may have 
different star heights and that star height is unrelated to the length. They also naturally 
give rise to the so-called star-height problem. As it does not directly pertain to the matter 
developed in this chapter, we postpone the few indications we give on this problem to the 
Notes section ((see also Chapter 5). 

Loop complexity of an automaton Let A — {Q,A,E,I,T) be an automaton; we 
call balls'^ the strongly connected components of A that contain at least one transition. 
In other words, a strongly connected component that contains at least two states is a ball, 
and a strongly connected component reduced to a single state s is a ball if and only if s is 
the source (and the destination) of at least one loop. Balls are pairwise disjoint but do not 
form a covering (hence a partition) of Q since a state may belong to no ball (cf. Figure 7). 


^Translation of the French: pelote. 
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Figure 7. An automaton, its strongly connected components and its two balls 


Definition 3.1. The loop complexity of an automaton A is the integer lc(^) defined 
inductively by the following equations: 

lc(^) =0 if ^ contains no balls (in particular if A is empty); 

lc(.4) = max {lc(7^) | V a ball in A} if A is not strongly connected; 

lc(.4) = 1 + min {lc(.4 \ {s}) | s state of .4} if A is strongly connected. 

Figure 8 shows automata with loop complexity 1, 2 and 3 respectively. 


(a) lc{.4) = 1 



(b) lc(A) = 2 



(c) lc(.4) = 3 


Figure 8. Automata with differents loop complexities 


Eggan’s Theorem We are now ready to state the announced refinement of the Funda¬ 
mental theorem of finite automata. 

Theorem 3.11 (Eggan [21]). The loop complexity of a trim automaton A is the minimum 
of the star height of the expressions computed on A by the state-elimination method. 

This theorem may be proved by establishing a more precise statement which involves 
a refinement of the loop complexity and which we call the loop index. 

Let A = {Q,A,E,I,T) be an automaton. If uj is an order on Q, we write ZJ for the 
greatest element of Q according to w. If <S is a sub-automaton of A, we also write cj for the 
trace of the order u over the set R of states of S and, in such a context, oJ for the greatest 
element of R according to oj. Then, the loop index of A relative to oj, written '3u){A), is 
the integer inductively defined by the following: 

• if ^ contains no ball, or is empty, then 


=0 ; 

(3.21) 

• if .4 is not itself a ball, then 


— max V ball in .4}) ; 

(3.22) 

• if ^ is a ball, then 


4cj(.4) = 1 -f \ w) . 

(3.23) 
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The difference with respect to loop complexity is that the state that we remove from a 
strongly connected automaton (in the inductive process) is fixed by the order w rather 
than being the result of a minimisation. This definition immediately implies that 

Property 3.12. Ic(.4) = min | w is an order on Q } . □ 

Theorem 3.11 is then a consequence of the following. 

Proposition 3.13 ([38]). For any order uj on the states of A, — h[Btj(.4,)]. 

At this point, let us note that in the inductive definition of Juj{A) it is the greatest 
element uJ of Q that is considered whereas in the construction of the expression (.4) it 
is the smallest element of Q suppressed first. 

In the course of the computation of B^ {A) by the state elimination method we con¬ 
sider automata whose transitions are labelled not by letters only but by rational expres¬ 
sions in general. In order to define the index of such generalised automata, we first define 
the index, written i (e), of a transition e as the star height of the label of e: 

i(e) = h[|e|] . 

The index of a generalised automaton is then defined inductively, by formulas that take 
into account the index of every transition: 

• if ^ is empty, then 

a^(.4)=0; (3.24) 

• if .4 is not itself a ball, then 

'^uiiA) = max ({i(e) I e does not belong to a ball in .4} 

U {3a>{V) \ V haWin A}) ; 

• if ^ is a ball, then 

3(.j(.4) = 1 + max ({i(e) I e is adjacent to w} ,T(^(.4 \ w)) . (3.26) 

It is obvious that equations (3.24)-(3.26) reduce to (3.22)-(3.2I) in the case of a ‘clas¬ 
sic’ automaton whose transitions are all labelled with letters, that is, have index 0. Figure 9 
shows two generalised automata and their index. 



ab*a 


{b{ba)*a)* 



ab*a 




(a) a„4Bi) = 2 ;a„ 2 ( 2 ?i) = 3 


(b) a„i(B2) 3 u }2 (%) = 3 


Figure 9. Computation of the index of two automata for the two possible orders on 
the states: coi = p < q, UJ 2 = q < P- 
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Proof of Proposition 3.13. We proceed by induction on the number of states of A. The 
state-elimination method consists in the first place of transforming A into an automaton B 
by adding two states to A, an initial state and a final state, and transitions which are all 
labelled by the empty word, the index of B being equal to that of A. By convention, the 
added states are greater than all the other states of A in the order oj and are never removed 
by the state-elimination method. On the other hand, the transition labels, including those 
of the new transitions, may be modified in the course of the state-elimination method. 

The base case of the induction is therefore a generalised automaton with 3 states of 
the form of Figure 10 (a) or (b). 



Figure 10. Base case of the induction 


In case (a), B contains no balls and we have 

3^{B) = max(h[E], h[F], h[H]) = h[E + F • H] = h[B^(f?)] . (3.27) 

In case (b), the unique state of B that is neither initial nor final is a ball whose index 
is 1 4- h[G], and we have 

a^(S) = max(h[E], h[F], h[H], (1 + h[G])) = h[E + F • G* • H] = h[B^(S)] . (3.28) 

Note that this reasoning is the essential part of the induction step and that for rigour, if not 
for clarity, we could have taken the automaton of Figure 10(c) as our base case, for which 
the statement is even more easily verified (and which corresponds to an automaton A with 
no state). 

Now let B be an automaton of the prescribed form with n 4- 2 states, q the smallest 
state in the order w and B' the automaton which results from the first step of the state- 
elimination method applied to B (which consists of the elimination of q). Because the 
same states (other than q) are adjacent in B as in B', and since q is the smallest element 
in the order w, the algorithm for computing the index runs the same way in B and B', 
that is, the succession of balls constructed in each automaton is identical, excluding the 
examination of q in B. It remains to show that the values calculated are also identical. 

Let V be the smallest ball in B that strictly contains g - if no such ball exists, take V — 
B', and let V' the ‘image’ of V in B'. There are two possible cases: either (a) q is not the 
source (and the destination) of a loop, or (b) it is, that is, q is a ball in B all by itself, and 
the label of this loop is an expression G. 

The transitions of V' are either identical to those of V, or, in case (a), labelled by 
products F • H, where F and H are labels of transitions of V, or, in case (b), labelled by 
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products F • G* • H. It therefore follows that, in case (a) 

'3uj{'P') = max(max {i(e) | e does not belong to a ball in V'} , 

max {2(^(2) I Q ball in "P'}) 

= max(max {i(e) | e does not belong to a ball in V} , 

max {2(^(2) I 2 ball in P}) 

= a^(P) . (3.29) 

In case (b), since = 1 + h[G], we have: 

= max( max {i (e) | e does not belong to a ball in V'} , 

max{2(,;(2) I 2 ball in P'} ) 

= max( max {i (e) | e does not belong to a ball in P} , (1 + h[G]), 

max{2(,;(2) I 2 ball in P'} ) 

= max( max {i (e) | e does not belong to a ball in P} , 3tj({9}), 

max {2(^(2) I 2 ball in P, different from {g}} ) 

= 3^(P) . (3.29)' 

If P = (and P' = B '), the equalities (3.29) and (3.29)' become 

(3.30) 

which proves the induction and hence the proposition. If not, and without starting an 
induction on the number of overlapping balls that contain {g}, we can get from (3.29) 
to (3.30) by noting that the transitions of B' are either identical to those of B, or correspond 
to transitions that are adjacent to g. 

In case (a), the labels of these transitions (those corresponding to transitions adjacent 
to g) are products of transitions of B: their index is obtained by taking a maximum, 
and (3.30) is the result of the identity max(a, b, c) = max(o, max(5, c)). 

In case (b), the labels of the same transitions are, as before, of the form F • G* • H , with 
index max(h[F], h[H], 1 + h[G]). The corresponding transition in B has label F (or FI); 
it is inspected in the algorithm for computing the index when the indices of the transition 
with label H (or F) and that of the ball {g}, with index 1 + h[G], have already been taken 
into account. The result, which is (3.30), follows for the same reason as above. □ 


Theorem 3.11 admits a kind of converse stated in the following proposition whose 
proof is postponed to the next section where we build automata from expressions. 


Proposition 3.14. With every rational expression E is associated an automaton which 
accepts |E| and whose loop complexity is equal to the star height of E. 
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4 From expressions to automata: the A-maps 

The transformation of rational expressions into finite automata establishes Proposition 2.4. 
It is even more interesting than the transformation in the other way, both from a theoret¬ 
ical point of view and for practical purposes, as there are many questions that cannot be 
answered directly on expressions but require first their transformation into automata. 

Every expression might be mapped to several automata, each of them being computed 
in different ways. We distinguish the objects themselves, that is, the computed automata, 
which we try to characterise as intrinsically as possible, from the algorithms that allow 
to compute them. We present two such automata: the Glushkov, or position, automaton 
and that we rather call the standard automaton of the expression, and the derived-term 
automaton, that was first defined by Antimirov. 

The standard automaton may be defined for expressions over any monoid whereas the 
derived-term automaton will be defined for expressions over a free monoid only. In this 
section however, we restrict ourselves to expressions over a free monoid. We begin with 
the presentation of two techniques for transforming an automaton into another one, that 
will help us in comparing the various automata associated with a given expression. 


4.1 Preparation: closure and quotient 

Closure Automata have been defined (Section 2.2) as graphs labelled by letters of an 
alphabet. It is known that the family of languages accepted by finite automata is not 
enlarged if transitions labelled by the empty word — called spontaneous transitions — 
are allowed as well. The backward closure of such an automaton A= {Q,A,E,I,T) is 
the equivalent automaton B = {Q,A,F,I,U) with no spontaneous transitions defined 
by 

E = |(p, a,r) I 3(7 € Q p q , {q,a,r) € and 17 = |p | 3(7 € T p q^ . 

It is effectively computable, as the determination of F and U amounts to computing the 
transitive closure of a finite directed graph. 


Morphisms and quotient Automata are structures; a morphism is a map from an au¬ 
tomaton into another one which is compatible with this structure. 

Definition 4.1. Let .4 = {Q,A,F,I,T) and A' = {Q',A,F',I',T') be two au¬ 
tomata. A map ip: Q ^ Q' is a morphism (of automata) if: 

(i) ip{i) c r, 

(ii) ip{T)CT', 

(iii) V(p, a,q) eF {ip{p), a, p{q)) G F'. 

The automaton A! is a quotient of A if, moreover, 

(iv) p{Q) = Q', that is, p is surjective, 

(V) p{I) = I', 

(Vi) p-\r) = T, 

(vii) \/{r,a,s) £ F', Wp G p~^{r), 3q G p~^{s) {p,a,q) G F. 
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If is a morphism, we write Lp-. A', and the inclusion |,4| C \A'\ holds. If A' is 
a quotient of A, then |,4| = \A!\ holds. 

Definition 4.1 generalises the classical notion of quotient of complete deterministic 
automata to arbitrary automata. Every automaton A admits a minimal quotient, which is 
a quotient of every quotient of A. In contrast with the case of deterministic automata, the 
minimal quotient of A is canonically associated with A, not with the language accepted 
by A. 


4.2 The standard automaton of an expression 

The first automaton we associate with an expression E, which we write iSe and which 
plays a central role in our presentation, has first been defined by Glushkov (in [29]). For 
the same purpose, McNaughton and Yamada computed the determinisation of Se in their 
paper [43] that we already quoted. In order to give an intrinsic description of 5 e, we 
define a restricted class of automata, and then show that rational operations on sets can be 
lifted on the automata of that class. 


4.2.1 Operations on standard automata An automaton is standard if it has only one 
initial state, which is the end of no transition. Figure 11 shows a standard automaton, both 
as a sketch, and under the matrix form. The definition does not forbid the initial state i 
from also being final and the scalar c, equal to 0 or 1, is the constant term of |^|. 



Figure 11. A standard automaton 


Every automaton is equivalent to a standard one. More important for our purpose, 
their special form allows to define operations on standard automata that are parallel to 
the rational operations. Fet A (as in Figure 11) and B (with obvious notation) be two 
standard automata; the following standard automata are defined; 


A + B = 


AB = 


1 0 


])’ 


0 

J 

K 

0 

F 

0 

0 

0 

G 


1 0 




0 

J 

cK 

0 

F 

U-K 

0 

0 

G 


f c + d\ 


V 

u 

V 

/ 

> 

/ 

cd 

\ 


- 





Ud 


> 

V 

V 

/ 

/ 


(4.1) 


(4.2) 
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j ^ , (4.3) 

where H = U ■ J + F . The use of the constants c and d allows a uniform treatment of 
the cases whether the initial states of A and B are final or not. 

These constructions are shown at Figure 12. 


/ 


0 

J 

\ 

1 

.4* = { 

(im). 

0 

H 

/ 

U 



Figure 12. Operations on standard automata 

Straightforward computations show that |(.4 + ;B)| = |.4| + |,B|, |(.4 • ,B)| = |.4| • |;B| and 

With every rational expression E and by induction on its depth, we thus canonically 
associate a standard automaton, which we write Se and which we call the standard au¬ 
tomaton of E. The induction and the computations show that the map E i-> iSe is a 
A-map: 

Proposition 4.1. If E is a rational expression over A*, then N = |e|. 

The inductive construction of iSe also implies: 

Property 4.2. If E is a rational expression, the dimension of <Se is f(E) + 1. 

Example 4.1. Figure 13 shows 5 ei, where Ei = {a*b -\- bb*a*)* . 

The example of E = ^((a* + 6*)* + c*)* + ... shows that the direct compu¬ 

tation of (Se by (4.1)-(4.3) leads to an algorithm whose complexity is cubic in f(E). The 
quest for a better algorithm leads to a construction that is interesting per se. 

4.2.2 The star-normal form of an expression The star-normal form of an expression 
has been defined by Briiggemann-Klein (in [11]) in order to design a quadratic algorithm 
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for the computation of the standard automaton of an expression. The interest of this notion 
certainly goes beyond that complexity improvement. 

Definition 4.2 ([11]). A rational expression E is in star-normal form (SNF) if and only if 
for any F such that F* is a subexpression of E, c(F) = 0 


Two operators on expressions, written • and □, are defined by a mutual recursion on 
the depth of the expression that defines and allows to compute the star-normal form of 
the expression. 


0° 

= 0, 1° = 

0, Va e A 

□ 

a - 

= a , 


(4.4) 


(F + G)° = 

□ 

+ 

□ 

LL 




(4.5) 


(F.G)° = 

□ 

(JD • 

□ • 

LL LL 

if c(F) = 
otherwise 

C(G) = 1 , 

(4.6) 


(F*)° = 

F° . 




(4.7) 

0* 

= 0, !• = 

1, Va e A 

a* - 

= a , 


(4.8) 


(F + G)* = 

• 

+ 

• 

LL 




(4.9) 


(F-G)* = 

F* • G* , 




(4.10) 


(F*)* = 

(F°)* . 




(4.11) 

Example 4.2. Let 

E 2 = (a*b*) 

* . Then 





E2* = ((a*b*y 

r=((«*)“ 

+(bry = 

((«)“ 

+ib)r 

= (a + b)*. 



Theorem 4.3 ([11]). For any expression E, E* is in star-normal form and iSe* = 45e • 

Theorem 4.3 implies in particular that E* is equivalent to E. It relies on three com¬ 
putations on Boolean standard automata which are the direct consequence of the formu¬ 
las (4.1)-(4.3). and of the idempotency identity I. 

Property 4.4. Let A, A', B and B' be four (Boolean) standard automata. 

(1) If .4* = A'* and B* = B'* then (.4 + B)* = (.4' -F B')* . 

(2) If c(|.4|) = c(|SD = 1, then (.4 + B)* = (A-B)* . 

(3) .4* = (.4*)* . 

^^The definition, as well as the construction, have been slightly modified from the original, for simplification. 
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Proof. (i) Let 

^={^(1 0 ), 

The hypothesis implies 


^(1 o)>(13 f)’(l)) -^' = ((1 o)>(! 


0 J' 

0 F'j'XU' 


»)■(? c./+f)'(i)) = -^'‘ = ((‘ i/'.x'+F')■((!•)) 

hence J = J', U = U' and U ■ J + F = U' ■ J' + F'. Accordingly, if we have 


(1 0 ), 


0 K 
0 Gj'W 


and B' ={{1 O) 


0 K'' 

0 G"’ 


V 


then K = K', V = V and V ■ K + G = V ■ K' + G' hold. From (4.1) and (4.3) 
follow 

{A + B)*=[{1 0 0), (o U-J + F U-K 

\ \0 V-K V-K + G \vl / 


= ( (1 0 0), 0 U' -J' + F' 


K' \ 
U'-K' 

0 V'-K' V'-K' + G'j 


^1) 

U' 

kV) 


= [jf + B'y 


(ii) With the same notation as before we have on one hand-side 


,4 + 6= ^ (1 0 0), 1^0 F o^j, y 

/ /OJA 

(,4 + 6)* = ( (1 0 0), 0 U-J + F U-K 


and then 


tO 


V-K V-K + G 


and on the other 


/OJA 


A-B= I {1 0 0), 0 F U-K , U 


U 

^Vl 


and then 


too G 


yV 


A 


{A-B)* = {{1 0 0), 0 U-J + F U-K + U-K , U 


to y ■ A 


y • A-hG 


tyy 


and the equality follows from the idempotency identity I. 
(iii) Again with the same notation, we have: 

■4 = ^ (1 0)’(o F-J + a)’([/)) ^ ^ 


0 J 

0 U - J + U - J + Fl’ 


and the equality follows from the idempotency identity I. 


□ 


Proof of Theorem 4.3. We establish by a simultaneous induction that the following fwo 
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statements hold: 

E* isinSNF and 5 e. = 5e (4.12) 

E° isinSNF and = Se- ■ (4.13) 

Both (4.12) and (4.13) clearly hold for the base clauses. 

• E = F + G 

- E* = F* + G* is in star-normal form by induction. Moreover, 

5e* = -F Sg’ — Se + Sg by induction 

= Se since E = F -F G . 

- E° = F° -F G° is in star-normal form by induction. Moreover, 

+ •Sg°)* = ('5f + by induction and by Property 4.4 (i) 

= Se* since E = F -F G . 

• E = F-G 

- E* = F* • G* is in star-normal form by induction. Moreover, 

<Se» = iSp* • iSg* = «Sf • <Sg by induction 

= <Se since E = F • G . 

- E° = F°-FG° or E° = F*-G* is in star-normal form by induction. Moreover: 

(i) if c(F) = c(G) = 1, then 

(iSpn)* = + <5 g°)* = (^F + ^^g)* by induction and by Property 4.4 (i) 

= (iSp • 5 g)* by Property 4.4 (ii) 

= Se* since E = F • G . 

(ii) if c(F)c(G) = 0 , then 

(iSpo)* = (iSp* • 5 g*)* = {Se ■ Sg)* by induction 

= Se* since E = F • G . 

• E = F* 

- E* = (F°)* is in star-normal form by induction and since c(F°) = 0 . More¬ 
over, 

iSp* = (<5 f°)* = >5f* by induction and (4.13) 

= 5e since E = F* . 

- E° = (F*)° = F° is in star-normal form by induction. Moreover 

(iSpn)* = (vSpn)* = iSp* by induction 

= iSp* since E = F and by Property 4.4 (iii) . 

□ 

As the computation of E* is linear in f (E), the goal is achieved by the following: 
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Theorem 4.5 ([11]). Let E be a rational expression in star-normal form. Then, the in¬ 
ductive computation of by (4.1 )-(4.3) has a quadratic complexity in f(E). 


Proof. 


to be done 


□ 


4.2.3 The Thompson automaton A survey on A-maps cannot miss out the method due 
to Thompson [58]. It was designed to be directly implementable as a program, primarily 
for searching with rational expressions in text. It is based on the use of spontaneous transi¬ 
tions. Figure 14 shows the basic steps of the construction, which, by induction, associates 
with an expression E a unique (and well-defined) automaton Ti- It is remarkable that this 
construction corresponds indeed to another way of defining the standard automaton. 

Proposition 4.6. The backward closure of 7 e is equal to <Se. 



(a) base cases 


(b) product 




Figure 14. Thompson’s construction 


Figure 15 shows the construction applied to the expression E 2 = {a*b -\- bb*a)* 


.xy- 

■<kz 

yrO-'- 


•0 


*0- 


- 0 ^ 

Xq- 


.0 


-Q:. <5; . -p . iD- 

b 


Figure 15. The automaton 7 e2 


4.2.4 Loop complexity of standard automata We end this section with the proof of 
Proposition 3.14: With every rational expression E, we can associate an automaton equiv¬ 
alent to E and whose loop complexity is equal to the star height of E. 
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The sketches at Figure 12 (a) and (b) make clear that if A and A' are standard au¬ 
tomata, it holds 

lc((,4. + ,4.')) = lc((^ • ,4.')) = max{lc(,4), lc(4l')} ■ 

It follows then from Definition 34 that the loop complexity of the sum and product of 
standard automata is equal to the star height of the expressions for the sum and product, 
provided equality hold for the operands. 

The same relation does not holds for the star of a standard automaton, as seen on the 
example shown at Figure 16: the loop complexity of the automaton is not necessarily 
incremented by the star operation. In the opposite way, the star operation (on standard au¬ 
tomata) may well increase the loop complexity by more than 1, as shown by . Hence, 
it is not true that Ic(iSe) = h[E] holds. 

a T a 

Figure 16. The standard automaton .43 of a (a^)* and A^ 

In order to circumvent this difficulty, we replace the star operation on standard au¬ 
tomata by a more elaborate one. A standard automaton is normalised if it has only one 
final state and if this final state is not the origin of any transition. An obvious construc¬ 
tion transforms any standard automaton A into an equivalent normalised one, which we 
write 4lnor^ and we have: 

Property 4.7. lc(Aor) = lc(4l). 

We further write A^ for the standard automaton A in which the initial state is not final. 
The automaton (^Inor*^ • -4°) has a cut-vertex t. Finally, let B = (4lnor° ■ -4°) and B' 
the automaton in which t has been made final. Clearly, \B\ = A* and Proposition 3.14 is 
established with the proof of the following lemma. 

Lemma 4.8. \c{B') = lc(.4) -T 1. 

Proof. The automaton B' without its initial state z is a ball; by definition, we have 

\c{B') = 1 + min {\c{B' \ {z, s})\seB'\i} . (4.14) 

The final state t of 4lnor is a cut vertex (of the underlying graph) of Anor^ • A^ If we 
set s = f in (4.14), the balls of B” = B' \ {i, s} are those of ^Inor*^ and A^ and hence 
\c{B') ^ lc(4l) -f 1. If we take an arbitrary s in ^Inor*^ or A^, B" contains either A} 
or4lnor° and lc(B') ^ lc(4l) + 1. □ 


4.3 The derived-term automaton of an expression 

Let us first recall the (left) quotient operation on languages: 

VL e ip (A*) ,yu€A* u-^L = {v€A*\uvGL} . 
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The quotient is a (right) action of A* on Cp (2l*): 

yL€^{A*),Vu,veA* {uv)-^L = v-^u-^L) . (4.15) 

A fundamental, and characteristic, property of rational languages — which is another 
way to express that they are recognisable — is that they have a finite number of quotients. 

The principle of the construction we present in this section, and which we call deriva¬ 
tion, is to transfer the quotient on languages to an operation on the expressions. First 
introduced by Brzozowski [12], the definition of the derivation of an expression E has 
been modified by Antimirov [4] (cf. Notes) and yields a non-deterministic automaton Ae, 
which we propose to call the derived-term automaton of E. This construction concerns 
thus expressions over/ree monoids only. In the sequel, E is a rational expression over A*. 


Definition 4.3 (Brzozowski-Antimirov [4]). The derivation of E with respect to a letter a 
of A, denoted by ^ E, is a set of rational expressions over A*, inductively defined by: 


da 

aa ° “ da^ ~ 


d 

yb€ A T-b = 
oa 


{ 1 } 


if b = a , 
otherwise, 


(F + G)= ^FU-G, 


da 


da 

d 


da 


da' 


7T:(F-G) = h-F .GUc(F)-G 


da 


(F*) = 


da 

_a 

da 


da 


(4.16) 

(4.17) 

(4.18) 

(4.19) 


Equation (4.18) should be understood with the convention that the product xX of a 
set AT by a Boolean value x is X \f x = 1 and 0 if a; = 0. The induction involved in 
Equations (4.17)-(4.19) should be interpreted by extending derivation additively (as are 
always derivation operators) and by distributing (on the right) the • operator over sets 
as well. Einally, every operation on rational expressions is computed modulo the trivial 
identities T, but not modulo the natural identies N — nor the idempotent identites I 
and J. 


Definition 4.4. The derivation of E with respect to a non-empty word r; of A*, denoted 
by ^ E , is the set of rational expressions over A*, defined by (4.17)-(4.19) for letters 
in A and by induction on the length of v by: 


Vm S A"*", Vo € A 


dua 


A 

da 



(4.20) 
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The derivation of expressions is indeed parallel to the quotient of languages as we 
have the following property. 

Property 4.9. 

VL, iT C Vo G A a-^{L[JK) = a-^L (J a-^K , 

a~^{LK) = {a~^L) U c{L)a~^K , 
a-i(L*) = {a-^L)L* . 

It follows, by induction on the depth of the expression: 

VE G RatE A* , Va G A 1 ^ e| = a"^|E| (4.21) 

'da ' 

which in turn implies, by induction on the length of words and the parallel between (4.20) 
and (4.15): 

VE G RatE A*,\/u€ A+ e| = m-^IEI . (4.22) 

'ou ' 

In particular, we have: 

Property 4.10. If E is not a constant, there exists u in such that c(^ E) = 1. 
Example 4.3. The derivation of Ei = {a*b + bb*a)* (cf. Example 4.1) yields: 

^a*5Ei = {Ei}, ^(6*aEi)* ={Ei}, ^(&*aEi)* = {&*aEi} . 

4.3.1 The derived-term automaton Derivation thus associates a pair of an expression 
and a word with a set of expressions. We now turn this map into an automaton. 

Definition 4.5. We call true derived term of E every expression that belongs to ^ E for 
some word w of Gl+; we write TD (E) for the set of true derived terms of E: 

TD(E) = U„g^+^E . (4.23) 

The set D (E) = TD (E) U {E} is the set of derived terms of E. 

Example 4.4 (Example4.3 cont.). D(Ei) = {Ei,a*&Ei,6*aEi} . 

The sets of derived terms and the rational operations are related by the following 
equations, from which most of the subsequent properties will be derived. 

Proposition 4.11. Let F and G be two expressions. Then, TD (F + G) = TD (F) U 
TD (G), TD (F • G) = (TD (F)) • G U TD (G), and TD (F*) = (TD (F)) • F* hold. 

Starting from TD (0) = TD (1) = 0 and TD (a) = {1} for every a in A, TD (E) 
can be computed from Proposition 4.11 by induction on d(E) and without reference to 
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the derivation operation (cf the prebases in [44] and Definition 6.2 below). It follows in 
particular that Card (TD (E)) ^ f (E) and thus: 

Corollary 4.12. Card (D (E)) < f (E) + 1. 

The computation of D (E) is a A-map, as expressed by the following. 

Definition 4.6 (Antimirov [4]). The derived-term automaton of E is the automaton Ae 
whose set of states is D (E) and whose transitions are defined by: 

(i) if K and K' are derived terms of E and a a letter of A, then (K, a, K') is a transition 
if and only if K' belongs to ^ K; 

(ii) the initial state is E; 

(iii) a derived term K is final if and only if c(K) = 1. 

Theorem 4.13 ([4]). For any rational expression E, |e| = |^e|. 

Example 4.5 (Example 4.4 cont.). The automaton Aei is shown at Figure 18. 





..0 




Figure 18. The automaton Aei ■ 


4.3.2 Relationship with the standard automaton The constructions of the standard 
and derived-term automata of an expression are of different nature. But both arise from the 
same inner structure of the expression by two inductive processes, and the two automata 
have a structural likeness which yields another proof of Corollary 4.12: 

Theorem 4.14 ([17]). For any rational expression E, Ae is a quotient of Se- 

4.3.3 Derivation and bracketing The derivation operator is sensitive to the bracketing 
of expressions; on the other hand, it does commute to the associativity identity A. 

Example 4.6. Let ab{c{ab))* be an expression which is not completely bracketed. The 
derivation of the two expressions obtained by different bracketings yields: 

D (a {b (c(ab))*)j = {a(b(c(ab))*), b (c(ab))*, (c(ab))* , (ab) (c (a b))*} . 

D ((ab) (c(ab))*j = {(ab) (c(ab))* , b(c(ab))*, (c(ab))*} . 

More precisely, we have the following. 

Proposition 4.15 ([3]). Let E, F and G be three rational expressions. Then: 

Card (D ((E • F) • G)) ^ Card (D (E ■ (F ■ G))) and A \— D ((E • F) • G) = D (E • (F • G)) . 
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5 Changing the monoid 

Most of what has been presented so far extends without problems from languages to 
subsets of arbitrary monoids, from expressions over a free monoid to expressions over 
such monoids. We run over definitions and statements to transform them accordingly. 
The main difference will be that rational and recognisable sets do not coincide anymore, 
making the link between finite automata and rational expressions even tighter, and ruling 
out quotient and derivation that refer to the recognisable ‘side’ of rational languages. 

Non-free monoids of interest in the field of computer science and automata theory 
are, among others, direct products of free monoids (for relations between words), free 
commutative monoids (for counting purpose), partially commutative, or trace, monoids 
(for modelling concurrent or parallel computations), free groups and polycyclic monoids 
(in relation with pushdown automata). 

In the sequel, M is a monoid, and 1m its identity element. 


5.1 Rationality 

Rational sets and expressions Product and star are defined in (M) as in ^ (A*) and 
the set of rational subsets of M, denoted by RatM, is the smallest subset of (M) 
which contains the finite sets (including the empty set) and which is closed under union, 
product, and star. 

Rational expressions over M are defined as those over A*, with the only difference 
that the atoms are the elements of M; their set is denoted by RatE M. We also write |E| 
for the subset denoted by an expression E. Two expressions are equivalent if they denote 
the same subset and we have the same statement as Proposition 2.1: 

Proposition 5.1. A subset of M is rational if and only if it is denoted by a rational 
expression over M. 

A subset G of M is a generating set if M = G* . The direct part of Proposition 5.1 
may be restated with more precision as: any rational subset of M is denoted by a rational 
expression whose atoms are taken in any generating set. It follows from the converse part 
that a rational subset of M is contained in a finitely generated submonoid. 


Finite automata An automaton over M, denoted hy A = {Q^ M, i?, /, T), is defined 
like an automaton over A*, with the only difference that the transitions are labelled by 
elements ofM: E C Qx M xQ. Then, A is finite if E is finite. 

The subset accepted by A, called the behaviour of A and denoted by |yl| as above, is 


the set oflAiels of successful computations: |.4| = G M 


3i € I,3t GT 


'}■ 


The fundamental theorem of finite automata In this setting, the statement appears 
more clearly different from Kleene’s theorem. Its first appearance" seems to be in Elgot 
and Mezei’s paper on rational relations. 

* 'Hidden in a footnote! 
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Theorem 5.2 ([23]). A subset of a monoid M is rational if and only if it is the behaviour 
of a finite automaton over M whose labels are taken in any generating set of M. 

There is not much to change in Propositions 2.3 and 2.4 to establish Theorem 5.2. 

Proposition 5.3 (T-maps). For every finite automaton A over M, there exist rational 
expressions over M which denote |aI|. 

All four methods described in Section 3 apply for arbitrary M, even if their formal 
proof may be slightly different (Arden’s lemma does not hold anymore). 

Proposition 5.4 (A-maps). For every rational expression E over M, there exist finite 
automata over M whose behaviour is equal to |E|. 

Here again, the algorithms and results described in Section 4.2 for the construction of 
the standard automaton, Thompson automaton, etc. pass over to expressions over M. On 
the contrary, quotients in M define recognisable subsets of M and not rational ones (see 
below) and derivation of expressions over M does not make sense anymore. 


5.2 Recognisability 

Definition 2.3 may be rephrased verbatim for arbitrary monoids. A subset P of M is said 
to be recognised by a morphism a: M N if P = a~^{a{P)). A subset of M is 
recognisable if it is recognised by a morphism from M into a finite monoid. The set of 
recognisable subsets of M is denoted by Rec M. 

Recognisable and rational subsets We can then reproduce almost verbatim the con¬ 
verse part of the proof of Theorem 2.5. Let P be in Rec M, recognised by a morphism a. 
We replace the alphabet A by any generating set G of M in the construction of the automa¬ 
ton Aa. If M is finitely generated, G is finite, so is Aa and P is rational by Theorem 5.2: 

Proposition 5.5 (McKnight [42]). If M is finitely generated, then RecM C RatM. 

On the other hand, the first part of the quoted proof does not generalise to non-free 
monoids and the inclusion in Proposition 5.5 is strict in general. For instance, the set 
(a,c)* = ((a, l)(l,c))* is a rational subset of a* x c* (where the product is formed 
component wise). It is accepted by a two-state automaton which induces a map p, from 
the generating set of a* x c* into 

= J) and ^((l,c)) = (° U) . 

But this map does not define a morphism from a* X c* into B^^^. 

Decision problems for rational sets In general. Rat M is not a Boolean algebra. This 
is also accompanied with undecidability results. The undecidability of Post Correspon¬ 
dence Problem, easily expressed in terms of monoid morphisms, implies for instance: 
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Theorem 5.6 (Rabin-Scott [48]). It is undecidable whether the intersection of two ratio¬ 
nal sets of {a, b}* x {c, d}* is empty or not. 

From which one deduce: 

Theorem 5.7 (Fischer-Rosenberg [24]). The equivalence of finite automata, and hence 
of rational expressions, over {a, b}* x {c, d}* is undecidable. 

In contrast, the cases where Rat M is an effective Boolean algebra— such as when M 
is a (finitely generated)/ree commutative monoid [28] or free group [26] — play a key 
role in model-checking issues which involve counters, or pushdown automata. 


6 Introducing weights 

Most of the statements about automata and expressions established in the previous sec¬ 
tions extend again without much difficulties in the weighted case, as we have taken care 
to formulate them adequately. There are two questions though that should be settled first 
in order to set up the framework of this generalisation. First, the definition of the star 
operator requires some mathematical apparatus to be meaningful. Second, the definition 
of weighted expressions has to be tuned in such a way that former computations such as 
the derivation remain valid. 


6.1 Weighted languages, automata, and expressions 

6.1.1 The series semiring The weights, with which we enrich the languages or subsets 
of monoids are taken in a semiring, so as to give the set of series we build the desired struc¬ 
ture. We are interested in weights as they actually appear in the modelisation of phenom¬ 
ena that we want to be able to describe (and not because they fullfil some axioms). These 
are the classical numerical semirings N, Z, Q, etc., the less classical (Z U +cx), min, -f ), 
etc. None of them are Conway semirings (cf. Chapter 20), N is a quasi-Conway semir¬ 
ing but not the others. In the sequel, K is a semiring. The unweighted case corresponds 
to K = B and will be refer to as the Boolean case. 

As in the Boolean case, free monoids give rise to results which do not hold in non-free 
ones (the Kleene-Schiitzenberger theorem). But not all non-free monoids allow to easily 
define series with weights in arbitrary semirings. We restrict ourselves to graded monoids, 
that is, which are equipped with a length function. They behave exactly like the free 
monoids as far as the construction of series is concerned, they cover many monoids that 

*^The definition of the behaviour of weighted automata also conceals a problem due to the existence of 
spontaneous or e-transitions). This is out of the scope of this chapter where we focus on the relationships 
between automata and expressions. All usual definitions eventually allow to establish that every automaton 
whose behaviour is defined is equivalent to a proper automaton. This is how we define a weighted automaton 
and where we begin our presentation. We thus save a significant amount of foundation results. On this subject, 
we refer to other chapters of this handbook (Chapters 4 and 20) and other works ([57, 9, 37, 54, 20, 41]). 


Long version 


February 13, 2015 


38 


J. Sakarovitch 


are considered in computer science, and they are sufficient to make clear the difference 
between the free and non-free cases as far as rationality is concerned. In the sequel, M is 
a finitely generated graded monoid. 

Series Any map s from M to K is formal power series (series for short) over M with 
coefficients in K. The image by s of an element m in M is written <s, m> and is called 
the coefficient ofm in s. The set of these series, written K((M)), is equipped with the (left 
and right) ‘exterior’ multiplications, the pointwise addition, and the ( Cauchy) product: 
for every m in M, <st, m> = uXt, v> . As M is graded, the product is 

well-defined, and the three operations make K{{M)) a semiring (cf. Chapter 4). 

The support of a series s is the subset of elements of M whose coefficient in s is 
not Or. a series with finite support is a polynomial', the set of polynomials over M with 
coefficients in K is written ]K(M). 

Topology The following definition of the star as an infinite sum calls for the definition 
of a topology on ]K((M)). The semirings K we consider are equipped with a topology 
defined by a distance, whether it is a discrete topology (N, Z, (Z U +oo, min, + ), etc.) 
or a more classical one (Q, R, another L((A^)), etc.). Since M is graded (and finitely 
generated) it is easy to derive a distance which defines on K((M)) the simple convergence 
topology: 

Sn converges to s if, and only if, for all m in M, <s„, m> converges to <s, m> . 

Along the same line, a family of series {si}ig/ is summable if for every m in M the 
family is summable (in K). An obvious case of summability is when for 

every m in M there is only a finite number of indices i such that Ksi,m'> is different 
from Ok, in which case the family is said to be locally finite. 

All quoted semirings that we consider are topological semirings, that is, not only 
equipped with a topology, but their semiring operations are continuous. We also use 
silently in the sequel the following identification: if Q is ^ finite set, the 

semiring of Q x Q-matrices with entries in K((M)) is isomorphic to the 

semiring of series on M with coefficients in 

Star The star, denoted t*, of an element t in an arbitrary topological semiring T (not 
only in a semiring of series) is defined if the family {f"}„gN is summable and in this case, 
t* =X]nGN^” and f is said to be Storable. If f* is defined, then t* = If+tt* = lT+t*t 
hold. If moreover T is a ring, this can be written (1 — t)t* = t*{l — t) = 1 and t* is 
the inverse of 1 — t. Generally in semirings, the star of an element may be viewed as a 
substitute of taking the inverse in a poor structure that has no inverse. Hence is the name 
rational given to objects that can be computed with the star. 

The constant term of a series s is the coefficient of the identity of M: c(s) = <s, 1 m> • 
A series is proper if its constant term is zero. If s is proper, the family {s"}„gN is locally 
finite since M is graded and the star of a proper series of ]K((M)) is thus always defined. 

Lemma 6.1. Let s and t be two series in ]K((M)). If s* is defined, then s*t is the unique 
solution of the equation X = sX + f. 
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6.1.2 Rational series and expressions The rational operations on K((M)) are; the two 
exterior multiplications by elements of K, the addition, the product, and the star which is 
not defined everywhere. A subset £ ofK{{M)) is closed under star if for every s 'mS such 
that s* is defined then s* belongs to £. The rational closure of a set £, written KRat £, is 
the smallest subset of K((M)) closed under the rational operations and which contains £. 
The set of (K-)rational series, written KRat M, is the rational closure of IK(M). 

Weighted rational expressions A rational expression on M with weight in K — a 
weighted expression — is defined by completing Definition 2.1 with two operations/or 
every k in K: if E is an expression, then so are (kE) and (Efc). The set of weighted 
rational expressions is written K RatE M. As for the languages, we write |E| for the 
series denoted by E, with the supplementary equations: |(fc E)| = A:|E| and |(Efc)| = |E|A:. 

The constant term c(E) is defined as in Definition 2.2 but for the last equation 
[c(F*) = 1] which is replaced by: ‘c(F*) = c(F)* if the latter is defined". An ex¬ 
pression is valid if its constant term is defined. As M is graded, c(E) = <|E|, 1 m> holds 
for every valid weighted rational expression E. Finally, the following holds: 

Proposition 6.2. A series of ]K((M)) is rational if and only if it is denoted by a valid 
rational 'K-expression over M. 

In this framework, we reformulate Lemma 6.1 as: 

Corollary 6.3. Let U and V be two expressions in KRatEM. If (c(U))* is defined, 
then U* V denotes the unique solution of the equation X = |U|X + |V|. 

6.1.3 Weighted automata and the fundamental theorem An automaton A over M 
with weight in K, a K-automaton for short, still written A = {Q, M, E,I,T), is an 
automaton where the sets of initial and final states are replaced with maps from Q to K, 
that is, every state has an initial and a final weight, and where the set E of transitions is 
contained in Q x K x (M \ 1 m) x Q, that is, every transition is labelled with a monomial 
in K(M), different from a constant term. The automaton A is finite if E is finite. 

Alternatively, the same automaton is (more often) written A — {I,E,T), with the 
convention taken at Section 2: E is the transition matrix of A, a Q x Q-matrix whose 
(p, g)-entry is the sum of the labels of all transitions from p to q, and / and T are vectors 
in In this setting, A is finite if every entry of i? is a polynomial of K(M). 

The label of a computation in A is, as above, the product of the labels of the transitions 
that form the computation, multiplied (on the left) by the initial weight of the origin and 
(on the right) by the final weight of the end of the computation. With the definition we 
have taken for automata (no transition labelled with a constant term), and because M is 
graded, the family of labels of all transitions of A is summable and the series accepted 
by A, also called behaviour of A and written |^|, is its sum. The fundamental theorem of 
automata then reads: 

Theorem 6.4. Let M be a graded monoid. A series o/]K((M)) is rational if and only if it 
is the behaviour of a finite ^-automaton over M. 
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6.1.4 Recognisable series The distinction between rational and recognisable carries 
over from subsets of a monoid M to series over M. The equivalence between automata 
over free monoids and matrix representation (cf. Section 2.3) paves the way to the defini¬ 
tion of recognisability. 

A K-representation of M of dimension Q is a triple (A, p, v) where /r: M —> 
is a morphism, and A and v are two vectors of The representation (A, p, v) realises 
the series s = ' b'i^) • tn ; a series in ]K((M)) is recognisable if it is realised 

by a representation and the set of recognisable series is denoted by KRec M. The family 
of rational and of recognisable series are distinct in general. A proof which is very similar 
to the one given at Section 2.3, and which is independent from K, yields the following. 

Theorem 6.5 (Kleene-Schiitzenberger). If A is finite, then KRat A* = KRec A* . 


6.2 From automata to expressions: the F-maps 

With the definition taken for a K-automaton A = {I,E,T), every entry of is a 

proper polynomial of IK(M), E is in hence a proper polynomial of 

and E* is well-defined. Lemma 2.6 generalises to K-automata and \A\ = I ■ E* - T holds. 

In every respect, the weighted case is similar to the Boolean one. The direct part of 
Theorem 6.4 follows from the generalised statement of Proposition 3.1: 

Proposition 6.6. The entries of the star of a proper matrix E o/IK((M))‘5^*5 belong to 
the rational closure of the entries of E. 

The same algorithms as those presented at Section 3: the state-elimination and system- 
solution methods, the McNaughton-Yamada and recursive algorithms, establish the 
weighted version of Proposition 2.3: 

Proposition 6.7. Let M be a graded monoid. For every finite M,-automaton A over M, 
there exist rational expressions over M which denote \A\ 

If the algorithms are the same, one has to establish nevertheless their correctness in this 
new and more complex framework. We develop the case of the system-solution method, 
the other ones could be treated in the same way. To begin with, we have to enrich the 
set of trivial identities in order to set up the definition of reduced weighted expressions, 
which in turn is necessary to define computations on expressions. The set T as defined at 
Section 3.1 is now denoted as Tu: 

E+0=E, 0+E = E, E-0 = 0, 0-E = 0, E-1 = E, 1-E=E, 0* = 1 (T,) 
and augmented with three other sets of identities: 

0kE = 0, E0k = 0, fc0 = 0, 0fc = 0, 1 kE=E, E1]k=E (Tr) 
k{hE) = kh£, {£k)h = £kh, {k£)h = k{£h) (Ak) 

\k = k\, E-(fcl) = Efc, (fcl)-E = A:E (Uk) 


Long version 


of Ch. 2 of AutoMAthA Handbook 



Automata and rational expressions 


41 


From now on, all computations on weighted expressions are performed modulo the trivial 
identities T = Tu A Tjj A Ajj A Ur . Besides the trivial identities, the natural identities 
N = A A D A C hold on the expressions of K RatE M for any K and (graded) M, and, 
in contrast, the identities I and J that are special to (M) do not hold anymore. 

The system-solution method starts from a proper automaton A = {I,E,T) of di¬ 
mension Q whose behaviour is \A\ = I -V where V = E* ■ T is a vector in K{{M))'^. 
Lemma 6.1 easily generalises and as E is proper (in {{M))), V is the unique solu¬ 
tion of the equation X = EX + T which we rewrite as a system of Card (Q) equations: 

Vpeg Vp = j:g^Q\Ep,g\v,+\Tpi\ (6.1) 

where the Vp are the ‘unknowns’, where the entries Ep^g, which are linear combinations 
of elements of M, are considered as expressions and denoted as such and where \Tp 1| is 
the series reduced to the monomial Tp 1 m- The system (6.1) may be solved by successive 
elimination of the unknowns, by means of Corollary 6.3. When all unknowns Vq have 
been eliminated following an order uj on Q, the computation yields an expression that we 
denote by Etj(A), as in Section 3.3, and |A| = |Etj(A)| holds. 

The parallel with the Boolean case can be carried on: given a K-automaton A of 
dimension Q, an ordering w, and a recursive division r on Q, the expressions Baj(A), 
Mcj (A), and Cr (A) that all denote |A| are computed by the state-elimination method, the 
McNaughton-Yamada and recursive algorithms respectively. The results on the compari¬ 
son between these expressions also extend to the weighted case. 

Proposition 6.8. For every order w on Q, (A) = E^ (A) holds. 

Vrot^osiiion 6.9. For every order ijj on Q, N A U I— M(,;(Ap, 5 ) = Bi^(Ap, 5 ) holds. 

Theorem 3.5 also extends to the weighted case (and it is now clear why it was impor¬ 
tant that identities I and J do not play a role in that result). 

Theorem 6.10. Let ui and oj' be two orders on the set of states of a 'K-automaton A 
Then, NASAP I — B^(A) = B^/(A) holds. 


6.3 From expressions to automata: the A-maps 

6.3.1 The standard automaton of a weighted expression The definition of a standard 
weighted automaton is the same as the one of a standard automaton for the Boolean case: 
a unique initial state on which the initial map takes the value Ijk and which is not the end 
of any transition. Such an automaton may thus be represented as in Figure 11 and every 
weighted automaton is equivalent to, and may be turned into, a standard one. 

As in the Boolean case, operations are defined on standard weighted automata that are 
parallel to the rational weighted operators. With the notation of Figure 11, the operators 
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A-\- B and A - B sit given by (4.1) and (4.2), kA and ,4.fc by 
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and A*, which is defined when c* is defined, by the following modification of (4.3): 



(4.3’) 


where H = U-c* J+F. As in Section 4.2, these operations allow to associate with every 
weighted expression E and by induction on its depth, a standard weighted automaton Se 
which we call the standard automaton of E. Straightforward computations show that 
\ikA)\ = k\A\, \iAk)\ = \A\k , |(^ + B)\ = \A\+\B\, |(yf • B)\ = \A\-\B\ and |(yf*)| = |^|*. 
From which one concludes that the construction of Se is a A-map: 


Proposition 6.11 ([14, 39]). If E is a weighted expression over A*, then N = |e|. 


The automaton 5 e has f(E) + 1 states. Computing iSe from (4.1), (4.2) and (4.3’) 
is cubic in f(E) and a star-normal form for weighted expressions is something that does 
not seem to exist in the general case. Figure 19 shows the standard Q-automaton Se^ 
associated with E 3 = (gO* + |5*)* and Z-automaton 5 e 4 associated with E 4 = (1 — 
a) a*. 



Figure 19. The Q-automaton iSe 3 and the Z-automaton iSe 4 


It is the necessary definition ofkA and Ak that rules out the equivalence km = mk , 
with TO in M, from the set of trivial identities. 


6.3.2 The derived-term automaton of a weighted expression The (left) quotient op¬ 
eration also extends from languages to series: for every s in ]K((A*)), and every u in A*, 
u~^s is defined by <.u~^s,v> = <.s,uv'> for every v in A*. The quotient is a (right) 
action of A* onK((A*)): {uv)~^s = v~'^ 

In contrast with the Boolean case, a series in KRat A* may have an infinite number 
of distinct quotients. However, the quotient operation allows to express a characteristic 
property of rational series. Let us call stable a subset U of IK ((A*)) closed under quotient. 
Then, a characterisation due to Jacob reads: a series o/]K({A*)) is rational if and only if 
it is contained in a finitely generated stable submodule o/K((A*)), cf [9, 56]. 

Derivation The derivation of weighted rational expressions implements the lifting of 
the quotient of series to the level of expressions. It yields an effective version of the 
characterisation quoted above. 
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In the sequel, addition in K is written © to distinguish it from the + operator in ex¬ 
pressions. The set of (left) linear combinations of K-expressions with coefficients in K 
is denoted, by abuse, by K(K RatEA*). In the following, [k E] or /c E is a monomial 
in K(]K RatE A*) whereas (fc E) is an expression in K RatE A*. An external right multi¬ 
plication on K(]K RatE A*) by an expression and by a scalar is needed in the sequel. It is 
first defined on monomials by {[k E] • F) = A: (E • F) and ([/c E] k') = fc (E k') and then 
extended to K(K RatE A*) by linearity. 


Definition 6.1 ([39]). The derivation of E in K RatE A* with respect to a in A, denoted 
by ^ E, is a linear combination of expressions in IK RatE A* defined by (4.16) for the 
base cases and inductively by the following formulas. 


d d d 



d d d 








and 



c(E)* 



(E*) 


The last equation is defined only if E* is a valid expression. The derivation of an 
expression with respect to a word u is defined by induction on the length of u: for ev¬ 
ery u in A+ and every a in A, E = ^ E) and the definition of derivation is 
consistent with that of quotient of series since for every u in A+, (E)| = u“^|E| holds. 


The derived-term automaton At Section 4.3.1, we have defined the derived terms of 
a (Boolean) expression as the expressions that occur in a derivation of that expression. 
Proposition 4.11 then established properties that allow to compute these derived terms, 
without derivation. For the weighted case, we take the same properties as the definition. 

Definition 6.2 ([39]). The set TD (E) of true derived terms of E in IK RatE A* is in¬ 
ductively defined by: TD [k E) = TD (E), TD (E k) = (TD (E) k), TD (E + F) = 
TD(E)UTD(F), TD(E-F) = (TD(E)) • F U TD(F), TD (E*) = (TD (E)) • E*, 
starting from the base cases TD (0) = TD (1) = 0 , and TD (a) = {1} for every a in A. 

TD (E) is a set of unitary monomials of K(IK RatE A*), with Card (TD (E)) ^ f (E). 
The set of derived terms of E is D (E) = TD (E)U{E} . Theorem 6.12 insures consistency 
between Definitions 6.1 and 6.2; the usefulness of the latter follows from Theorem 6.13. 

Theorem 6.12 ([51, 39]). Let E be in IK RatE A* and D (E) = {Ki,..., K„}. For every a 
in A, there exist an nx n-matrix ^{a) with entries in IK such that 

^ W M “ ®i6N Kj . 

The derivation of an expression E in K RatE A* with respect to every word in A+ is 
thus a linear combination of derived terms of E. Hence the derived terms of an expres¬ 
sion denote the generators of a stable submodule that contains the series denoted by the 
expression. Theorem 6.12 yields the the derived-term automaton of E, Ae = {I,X,T), 
of dimension D (E), with / = Ik if = E and Ok otherwise, X — /x(a) a , and 

Tj = c(Kj). The IK-derivation is another A-map since |Ae| = |E| holds. 
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Morphisms and quotients of (Boolean) automata are generalised to Out-morphisms 
and quotients of K-automata (cf. [56, 5]). Theorem 4.14 is then extended to the weighted 
case. 

Theorem 6.13 ([39]). Let E be in K RatE A*. Then Ae is a quotient of Se- 

Remark 6.14. This statement is a justification for Definition 6.2. The monomials that ap¬ 
pear in the derivations of an expression E are in D (E). The converse is not necessarily true 
when K is not a positive semiring: some derived terms may never occur in a derivation, 
as it can be observed for instance on the Z-expression E 4 = (1 — a) a* {cf. Figure 20). 
With a definition of derived terms based on derivation only. Theorem 6.13 would not hold 
anymore. 



Figure 20. The Z-automaton iSe 4 and its Z-quotient Ae 4 


7 Notes 

Most of the material presented in this chapter has appeared in previous work of the author 
[54, 55, 56]. 

Section 1. New look at Kleene’s theorem A detained history of the development of 
ideas at the beginning of the theory of automata is given in [46]. Berstel [7] attributes to 
Eilenberg the idea of distinguishing the family of recognisable from that of rational sets. 

Besides the already quoted Elgot and Mezei’s paper [23], other authors have certainly 
noticed the equality of expressiveness of automata and expressions beyond free monoids. 
It is part of Walljasper’s thesis [59]; it can be found in Eilenberg’s treatise [22]. The 
splitting of Kleene’s theorem has been proposed in [53]. 

Section 3. From automata to expressions First note that this section is mostly of 
theoretical interest: for which practical purpose would one exchange an automaton for an 
expression? 

Identities. As mentioned, the axiomatisation of rational expressions, even hinting at 
bibliographic references, is out of the scope of this chapter. Conway showed that besides 
the identities S and P (that are at the basis of the definition of the so-called ‘Conway 
semirings’, cf. Chapter 20), each finite simple group gives rise to an identity that is inde¬ 
pendent from the others [18]. Krob, who showed that this set of identities is complete, 
coined S and P the aperiodic identities [35]. 
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State-elimination method. The example of Figure 3 is easily generalised so as to 
find an exponential gap between the length of expressions for two distinct orders. The 
search for short expressions is performed by heuristics; as reported in [30], the naive one, 
modified or not as in [19], appears to be good {cf. Chapter 12 for more information on the 
subject). 

McNaughton-Yamada algorithm is the implementation in the semiring of languages 
of the comtemporary Floyd-Roy-Warshall algorithms (in the Boolean or tropical semir¬ 
ings) [27, 49, 60]. 

Star height. The star height of a rational language L is the minimum of the star 
heights of the rational expressions that denote L. Whether the star height of a language 
is effectively computable has been a long standing open problem until it was positively 
solved first by K. Hashiguchi [31] and then by D. Kirsten [33]. 

Section 4. From expressions to automata The presentation of the standard automaton 
given here is not the classical one, and not only for the chosen name. The recursive defini¬ 
tion, also used in [25] for instance, avoids the definition of First, Last, and Follow 
functions that are built in most papers on the subject. Based on these functions, other au¬ 
tomata may be defined: e.g. in [43] they are used to compute directly the determinisation 
of iSe, in [32] positions with the same image by Follow are merged, giving rise to a 
possibly smaller automaton, called/ollow automaton. 

Attributing derivation to Brzozowski and Antimirov together is an unusual but sensible 
foreshortening. Original Brzozowski’s derivatives [12] are obtained by replacing ‘U’ by 
a ‘-f’ in (4.17) and (4.18). Derivatives are then expressions, and there is a finite number of 
them, modulo the A, C, and I identities. By replacing the ‘-f’ by a ‘U’ in Brzozowski’s 
definition, Antimirov [4] changed the derivatives into a set of expressions, which he called 
partial derivatives, as they are ‘parts’ of derivatives. As they are applied to union of 
sets, and not to expressions, the A, C, and I identities come for free, and are no longer 
necessary to insure the finiteness of derived terms. 

A common technique for defining A-maps has been the linearisation E of the expres¬ 
sion E, that is, making all letters in E distinct by indexing them by their position in E 
(e.g. [43, 32]). Berry-Sethi [6] showed that the (Brzozowski) derivatives of E coincide 
with the states of Se, whereas Berstel-Pin [8] observed that |E|is a local language L and 
interpreted Berry-Sethi’s result as the construction of the deterministic automaton canon¬ 
ically associated with L. 

The similarity between Mirkin’s prebases [44] and Antimirov’s derived terms was 
noted by Champarnaud-Ziadi [16], who called equation automaton the derived-term au¬ 
tomaton. 

Allauzen-Mohri have generalised Proposition 4.6 and Theorem 4.14 and computed Ae 
and the follow automaton of E from 7e by quotient and elimination of marked sponta¬ 
neous transitions [2]. 

In [15], an algorithm is given which is a kind of converse of a A-map: it recognises if 
an automaton is the standard automaton iSe of an expression E and, in this case, computes 
such an E in star-normal form. The problem of inverting a F-map has been given a partial 
answer in [40]: it is possible to compute A from B(^(A) for certain A (and any w); this 
has lead to the definition of a variant of the derivation: the broken derivation, that has 
been further studied in [3]. 
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Section 5. Changing the monoid Proposition 5.5 leads naturally to consider monoids M 
in which Rat M = Rec M holds, and which one could call Kleene monoids. In [52] was 
defined the family of rational monoids which contains all previously known examples of 
Kleene monoids; still the inclusion is strict [45]. Commutative Kleene monoids, as well 
as finitely generated submonoids of Rat a* are rational monoids [50, 1]. 

Section 6. Introducing weights If the definition of rational (and algebraic) series in 
non-commuting variables as generalisation of rational (and context-free) languages on 
one hand-side, as well as the formalisation of rational expressions on the other, date back 
to the beginning of automata theory, the formalisation of weighted rational expressions 
seem to have appeared in various papers in the years 2000 only [14, 51, 39]. A satisfactory 
definition of trivial identities for weighted expressions proves to be tricky and has evolved 
in the publications of the author. 

By replacing quotient and derivation by co-induction, Rutten formulated the equiva¬ 
lent of Theorem 6.12 [51]. 

Krob [36] and Berstel-Reutenauer [10] have considered ‘weighted rational expres¬ 
sion’ slightly different from those expression dealt with in this chapter. With their dif¬ 
ferentiation and derivation, they have tackled different problems than the construction of 
A-maps. 
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